Dataset statistics
| Number of variables | 26 |
|---|---|
| Number of observations | 143424 |
| Missing cells | 543643 |
| Missing cells (%) | 14.6% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 28.5 MiB |
| Average record size in memory | 208.0 B |
Variable types
| Numeric | 13 |
|---|---|
| Categorical | 13 |
medical_specialty has a high cardinality: 72 distinct values | High cardinality |
primary_diagnosis_code has a high cardinality: 716 distinct values | High cardinality |
other_diagnosis_codes has a high cardinality: 19374 distinct values | High cardinality |
ndc_code has a high cardinality: 251 distinct values | High cardinality |
encounter_id is highly overall correlated with patient_nbr | High correlation |
patient_nbr is highly overall correlated with encounter_id | High correlation |
race is highly imbalanced (56.8%) | Imbalance |
race has 3309 (2.3%) missing values | Missing |
weight has 139122 (97.0%) missing values | Missing |
payer_code has 54190 (37.8%) missing values | Missing |
medical_specialty has 69463 (48.4%) missing values | Missing |
ndc_code has 23462 (16.4%) missing values | Missing |
max_glu_serum has 136409 (95.1%) missing values | Missing |
A1Cresult has 117650 (82.0%) missing values | Missing |
number_emergency is highly skewed (γ1 = 21.51520047) | Skewed |
number_outpatient has 120027 (83.7%) zeros | Zeros |
number_inpatient has 96698 (67.4%) zeros | Zeros |
number_emergency has 127444 (88.9%) zeros | Zeros |
num_procedures has 65788 (45.9%) zeros | Zeros |
Reproduction
| Analysis started | 2023-03-27 18:05:39.578175 |
|---|---|
| Analysis finished | 2023-03-27 18:06:09.244241 |
| Duration | 29.67 seconds |
| Software version | ydata-profiling v0.0.dev0 |
| Download configuration | config.json |
encounter_id
Real number (ℝ)
| Distinct | 101766 |
|---|---|
| Distinct (%) | 71.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.6742899 × 108 |
| Minimum | 12522 |
|---|---|
| Maximum | 4.4386722 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 12522 |
|---|---|
| 5-th percentile | 28302468 |
| Q1 | 88295964 |
| median | 1.5476371 × 108 |
| Q3 | 2.3208969 × 108 |
| 95-th percentile | 3.7897561 × 108 |
| Maximum | 4.4386722 × 108 |
| Range | 4.438547 × 108 |
| Interquartile range (IQR) | 1.4379372 × 108 |
Descriptive statistics
| Standard deviation | 1.0190458 × 108 |
|---|---|
| Coefficient of variation (CV) | 0.60864356 |
| Kurtosis | -0.099966293 |
| Mean | 1.6742899 × 108 |
| Median Absolute Deviation (MAD) | 70195368 |
| Skewness | 0.6796994 |
| Sum | 2.4013336 × 1013 |
| Variance | 1.0384543 × 1016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 58316058 | 6 | < 0.1% |
| 63415968 | 6 | < 0.1% |
| 63184686 | 6 | < 0.1% |
| 110310714 | 6 | < 0.1% |
| 60016020 | 6 | < 0.1% |
| 205689816 | 5 | < 0.1% |
| 174335250 | 5 | < 0.1% |
| 197878764 | 5 | < 0.1% |
| 184457202 | 5 | < 0.1% |
| 377841854 | 5 | < 0.1% |
| Other values (101756) | 143369 |
| Value | Count | Frequency (%) |
| 12522 | 2 | |
| 15738 | 2 | |
| 16680 | 2 | |
| 28236 | 1 | < 0.1% |
| 35754 | 1 | < 0.1% |
| 36900 | 2 | |
| 40926 | 3 | |
| 42570 | 1 | < 0.1% |
| 55842 | 3 | |
| 62256 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 443867222 | 1 | < 0.1% |
| 443857166 | 3 | |
| 443854148 | 2 | |
| 443847782 | 1 | < 0.1% |
| 443847548 | 2 | |
| 443847176 | 2 | |
| 443842778 | 1 | < 0.1% |
| 443842340 | 1 | < 0.1% |
| 443842136 | 1 | < 0.1% |
| 443842070 | 1 | < 0.1% |
patient_nbr
Real number (ℝ)
| Distinct | 71518 |
|---|---|
| Distinct (%) | 49.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 54936079 |
| Minimum | 135 |
|---|---|
| Maximum | 1.8950262 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 135 |
|---|---|
| 5-th percentile | 1596573 |
| Q1 | 23572188 |
| median | 46307830 |
| Q3 | 88236270 |
| 95-th percentile | 1.1168201 × 108 |
| Maximum | 1.8950262 × 108 |
| Range | 1.8950248 × 108 |
| Interquartile range (IQR) | 64664082 |
Descriptive statistics
| Standard deviation | 38578400 |
|---|---|
| Coefficient of variation (CV) | 0.70224159 |
| Kurtosis | -0.32759223 |
| Mean | 54936079 |
| Median Absolute Deviation (MAD) | 31955810 |
| Skewness | 0.46901037 |
| Sum | 7.8791522 × 1012 |
| Variance | 1.4882929 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 90609804 | 52 | < 0.1% |
| 91751121 | 50 | < 0.1% |
| 89472402 | 50 | < 0.1% |
| 62352252 | 46 | < 0.1% |
| 84397842 | 41 | < 0.1% |
| 29903877 | 40 | < 0.1% |
| 88785891 | 40 | < 0.1% |
| 90164655 | 38 | < 0.1% |
| 37096866 | 38 | < 0.1% |
| 43140906 | 33 | < 0.1% |
| Other values (71508) | 142996 |
| Value | Count | Frequency (%) |
| 135 | 5 | |
| 378 | 1 | < 0.1% |
| 729 | 1 | < 0.1% |
| 774 | 2 | < 0.1% |
| 927 | 1 | < 0.1% |
| 1152 | 5 | |
| 1305 | 1 | < 0.1% |
| 1314 | 4 | |
| 1629 | 1 | < 0.1% |
| 2025 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 189502619 | 1 | < 0.1% |
| 189481478 | 3 | |
| 189445127 | 4 | |
| 189365864 | 1 | < 0.1% |
| 189351095 | 1 | < 0.1% |
| 189349430 | 1 | < 0.1% |
| 189332087 | 2 | |
| 189298877 | 1 | < 0.1% |
| 189257846 | 3 | |
| 189215762 | 1 | < 0.1% |
race
Categorical
IMBALANCE  MISSING 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3309 |
| Missing (%) | 2.3% |
| Memory size | 1.1 MiB |
| Caucasian | |
|---|---|
| AfricanAmerican | |
| Hispanic | 2938 |
| Other | 2174 |
| Asian | 888 |
Length
| Max length | 15 |
|---|---|
| Median length | 9 |
| Mean length | 10.023274 |
| Min length | 5 |
Characters and Unicode
| Total characters | 1404411 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Caucasian |
|---|---|
| 2nd row | Caucasian |
| 3rd row | AfricanAmerican |
| 4th row | Caucasian |
| 5th row | Caucasian |
Common Values
| Value | Count | Frequency (%) |
| Caucasian | 107688 | |
| AfricanAmerican | 26427 | 18.4% |
| Hispanic | 2938 | 2.0% |
| Other | 2174 | 1.5% |
| Asian | 888 | 0.6% |
| (Missing) | 3309 | 2.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| caucasian | 107688 | |
| africanamerican | 26427 | 18.9% |
| hispanic | 2938 | 2.1% |
| other | 2174 | 1.6% |
| asian | 888 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 379744 | |
| i | 167306 | |
| n | 164368 | |
| c | 163480 | |
| s | 111514 | 7.9% |
| C | 107688 | 7.7% |
| u | 107688 | 7.7% |
| r | 55028 | 3.9% |
| A | 53742 | 3.8% |
| e | 28601 | 2.0% |
| Other values (7) | 65252 | 4.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1237869 | |
| Uppercase Letter | 166542 | 11.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 379744 | |
| i | 167306 | |
| n | 164368 | |
| c | 163480 | |
| s | 111514 | 9.0% |
| u | 107688 | 8.7% |
| r | 55028 | 4.4% |
| e | 28601 | 2.3% |
| f | 26427 | 2.1% |
| m | 26427 | 2.1% |
| Other values (3) | 7286 | 0.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 107688 | |
| A | 53742 | |
| H | 2938 | 1.8% |
| O | 2174 | 1.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1404411 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 379744 | |
| i | 167306 | |
| n | 164368 | |
| c | 163480 | |
| s | 111514 | 7.9% |
| C | 107688 | 7.7% |
| u | 107688 | 7.7% |
| r | 55028 | 3.9% |
| A | 53742 | 3.8% |
| e | 28601 | 2.0% |
| Other values (7) | 65252 | 4.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1404411 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 379744 | |
| i | 167306 | |
| n | 164368 | |
| c | 163480 | |
| s | 111514 | 7.9% |
| C | 107688 | 7.7% |
| u | 107688 | 7.7% |
| r | 55028 | 3.9% |
| A | 53742 | 3.8% |
| e | 28601 | 2.0% |
| Other values (7) | 65252 | 4.6% |
gender
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Memory size | 1.1 MiB |
| Female | |
|---|---|
| Male |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.0624115 |
| Min length | 4 |
Characters and Unicode
| Total characters | 726046 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Female |
|---|---|
| 2nd row | Female |
| 3rd row | Female |
| 4th row | Male |
| 5th row | Male |
Common Values
| Value | Count | Frequency (%) |
| Female | 76185 | |
| Male | 67234 | |
| (Missing) | 5 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| female | 76185 | |
| male | 67234 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 219604 | |
| a | 143419 | |
| l | 143419 | |
| F | 76185 | 10.5% |
| m | 76185 | 10.5% |
| M | 67234 | 9.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 582627 | |
| Uppercase Letter | 143419 | 19.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 219604 | |
| a | 143419 | |
| l | 143419 | |
| m | 76185 | 13.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 76185 | |
| M | 67234 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 726046 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 219604 | |
| a | 143419 | |
| l | 143419 | |
| F | 76185 | 10.5% |
| m | 76185 | 10.5% |
| M | 67234 | 9.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 726046 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 219604 | |
| a | 143419 | |
| l | 143419 | |
| F | 76185 | 10.5% |
| m | 76185 | 10.5% |
| M | 67234 | 9.3% |
age
Categorical
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| [70-80) | |
|---|---|
| [60-70) | |
| [50-60) | |
| [80-90) | |
| [40-50) | |
| Other values (5) |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.0241103 |
| Min length | 6 |
Characters and Unicode
| Total characters | 1007426 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | [0-10) |
|---|---|
| 2nd row | [10-20) |
| 3rd row | [20-30) |
| 4th row | [30-40) |
| 5th row | [40-50) |
Common Values
| Value | Count | Frequency (%) |
| [70-80) | 36928 | |
| [60-70) | 32741 | |
| [50-60) | 25095 | |
| [80-90) | 23527 | |
| [40-50) | 13729 | 9.6% |
| [30-40) | 4964 | 3.5% |
| [90-100) | 3619 | 2.5% |
| [20-30) | 1927 | 1.3% |
| [10-20) | 733 | 0.5% |
| [0-10) | 161 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 70-80 | 36928 | |
| 60-70 | 32741 | |
| 50-60 | 25095 | |
| 80-90 | 23527 | |
| 40-50 | 13729 | 9.6% |
| 30-40 | 4964 | 3.5% |
| 90-100 | 3619 | 2.5% |
| 20-30 | 1927 | 1.3% |
| 10-20 | 733 | 0.5% |
| 0-10 | 161 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 290467 | |
| [ | 143424 | |
| - | 143424 | |
| ) | 143424 | |
| 7 | 69669 | 6.9% |
| 8 | 60455 | 6.0% |
| 6 | 57836 | 5.7% |
| 5 | 38824 | 3.9% |
| 9 | 27146 | 2.7% |
| 4 | 18693 | 1.9% |
| Other values (3) | 14064 | 1.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 577154 | |
| Open Punctuation | 143424 | 14.2% |
| Dash Punctuation | 143424 | 14.2% |
| Close Punctuation | 143424 | 14.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 290467 | |
| 7 | 69669 | 12.1% |
| 8 | 60455 | 10.5% |
| 6 | 57836 | 10.0% |
| 5 | 38824 | 6.7% |
| 9 | 27146 | 4.7% |
| 4 | 18693 | 3.2% |
| 3 | 6891 | 1.2% |
| 1 | 4513 | 0.8% |
| 2 | 2660 | 0.5% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 143424 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 143424 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 143424 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1007426 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 290467 | |
| [ | 143424 | |
| - | 143424 | |
| ) | 143424 | |
| 7 | 69669 | 6.9% |
| 8 | 60455 | 6.0% |
| 6 | 57836 | 5.7% |
| 5 | 38824 | 3.9% |
| 9 | 27146 | 2.7% |
| 4 | 18693 | 1.9% |
| Other values (3) | 14064 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1007426 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 290467 | |
| [ | 143424 | |
| - | 143424 | |
| ) | 143424 | |
| 7 | 69669 | 6.9% |
| 8 | 60455 | 6.0% |
| 6 | 57836 | 5.7% |
| 5 | 38824 | 3.9% |
| 9 | 27146 | 2.7% |
| 4 | 18693 | 1.9% |
| Other values (3) | 14064 | 1.4% |
weight
Categorical
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 139122 |
| Missing (%) | 97.0% |
| Memory size | 1.1 MiB |
| [75-100) | |
|---|---|
| [50-75) | |
| [100-125) | |
| [125-150) | |
| [25-50) | 118 |
| Other values (4) | 144 |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 7.9446769 |
| Min length | 4 |
Characters and Unicode
| Total characters | 34178 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 5 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | [75-100) |
|---|---|
| 2nd row | [50-75) |
| 3rd row | [50-75) |
| 4th row | [0-25) |
| 5th row | [0-25) |
Common Values
| Value | Count | Frequency (%) |
| [75-100) | 1817 | 1.3% |
| [50-75) | 1133 | 0.8% |
| [100-125) | 890 | 0.6% |
| [125-150) | 200 | 0.1% |
| [25-50) | 118 | 0.1% |
| [0-25) | 67 | < 0.1% |
| [150-175) | 55 | < 0.1% |
| [175-200) | 18 | < 0.1% |
| >200 | 4 | < 0.1% |
| (Missing) | 139122 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 75-100 | 1817 | |
| 50-75 | 1133 | |
| 100-125 | 890 | |
| 125-150 | 200 | 4.6% |
| 25-50 | 118 | 2.7% |
| 0-25 | 67 | 1.6% |
| 150-175 | 55 | 1.3% |
| 175-200 | 18 | 0.4% |
| 200 | 4 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 7031 | |
| 5 | 5804 | |
| [ | 4298 | |
| - | 4298 | |
| ) | 4298 | |
| 1 | 4125 | |
| 7 | 3023 | |
| 2 | 1297 | 3.8% |
| > | 4 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 21280 | |
| Open Punctuation | 4298 | 12.6% |
| Dash Punctuation | 4298 | 12.6% |
| Close Punctuation | 4298 | 12.6% |
| Math Symbol | 4 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 7031 | |
| 5 | 5804 | |
| 1 | 4125 | |
| 7 | 3023 | |
| 2 | 1297 | 6.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 4298 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4298 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 4298 |
Math Symbol
| Value | Count | Frequency (%) |
| > | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 34178 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 7031 | |
| 5 | 5804 | |
| [ | 4298 | |
| - | 4298 | |
| ) | 4298 | |
| 1 | 4125 | |
| 7 | 3023 | |
| 2 | 1297 | 3.8% |
| > | 4 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 34178 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 7031 | |
| 5 | 5804 | |
| [ | 4298 | |
| - | 4298 | |
| ) | 4298 | |
| 1 | 4125 | |
| 7 | 3023 | |
| 2 | 1297 | 3.8% |
| > | 4 | < 0.1% |
admission_type_id
Real number (ℝ)
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.0276941 |
| Minimum | 1 |
|---|---|
| Maximum | 8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 6 |
| Maximum | 8 |
| Range | 7 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.4275852 |
|---|---|
| Coefficient of variation (CV) | 0.70404366 |
| Kurtosis | 2.0527903 |
| Mean | 2.0276941 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.5939553 |
| Sum | 290820 |
| Variance | 2.0379995 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 74713 | |
| 3 | 27756 | 19.4% |
| 2 | 26823 | 18.7% |
| 6 | 7015 | 4.9% |
| 5 | 6584 | 4.6% |
| 8 | 488 | 0.3% |
| 7 | 33 | < 0.1% |
| 4 | 12 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 74713 | |
| 2 | 26823 | 18.7% |
| 3 | 27756 | 19.4% |
| 4 | 12 | < 0.1% |
| 5 | 6584 | 4.6% |
| 6 | 7015 | 4.9% |
| 7 | 33 | < 0.1% |
| 8 | 488 | 0.3% |
| Value | Count | Frequency (%) |
| 8 | 488 | 0.3% |
| 7 | 33 | < 0.1% |
| 6 | 7015 | 4.9% |
| 5 | 6584 | 4.6% |
| 4 | 12 | < 0.1% |
| 3 | 27756 | 19.4% |
| 2 | 26823 | 18.7% |
| 1 | 74713 |
discharge_disposition_id
Real number (ℝ)
| Distinct | 26 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.6553157 |
| Minimum | 1 |
|---|---|
| Maximum | 28 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 18 |
| Maximum | 28 |
| Range | 27 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 5.2192793 |
|---|---|
| Coefficient of variation (CV) | 1.4278601 |
| Kurtosis | 6.4193987 |
| Mean | 3.6553157 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.6336162 |
| Sum | 524260 |
| Variance | 27.240876 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 85308 | |
| 3 | 19677 | 13.7% |
| 6 | 18945 | 13.2% |
| 18 | 4658 | 3.2% |
| 22 | 3077 | 2.1% |
| 2 | 2906 | 2.0% |
| 11 | 1911 | 1.3% |
| 5 | 1631 | 1.1% |
| 25 | 1285 | 0.9% |
| 4 | 1090 | 0.8% |
| Other values (16) | 2936 | 2.0% |
| Value | Count | Frequency (%) |
| 1 | 85308 | |
| 2 | 2906 | 2.0% |
| 3 | 19677 | 13.7% |
| 4 | 1090 | 0.8% |
| 5 | 1631 | 1.1% |
| 6 | 18945 | 13.2% |
| 7 | 782 | 0.5% |
| 8 | 147 | 0.1% |
| 9 | 29 | < 0.1% |
| 10 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 28 | 200 | 0.1% |
| 27 | 5 | < 0.1% |
| 25 | 1285 | 0.9% |
| 24 | 65 | < 0.1% |
| 23 | 602 | 0.4% |
| 22 | 3077 | |
| 20 | 4 | < 0.1% |
| 19 | 8 | < 0.1% |
| 18 | 4658 | |
| 17 | 20 | < 0.1% |
admission_source_id
Real number (ℝ)
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.7010961 |
| Minimum | 1 |
|---|---|
| Maximum | 25 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 7 |
| Q3 | 7 |
| 95-th percentile | 17 |
| Maximum | 25 |
| Range | 24 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 4.0645318 |
|---|---|
| Coefficient of variation (CV) | 0.71293866 |
| Kurtosis | 1.753153 |
| Mean | 5.7010961 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.033245 |
| Sum | 817674 |
| Variance | 16.520419 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 80443 | |
| 1 | 42773 | |
| 17 | 9338 | 6.5% |
| 4 | 4467 | 3.1% |
| 6 | 3108 | 2.2% |
| 2 | 1500 | 1.0% |
| 5 | 1048 | 0.7% |
| 20 | 247 | 0.2% |
| 3 | 247 | 0.2% |
| 9 | 185 | 0.1% |
| Other values (7) | 68 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 42773 | |
| 2 | 1500 | 1.0% |
| 3 | 247 | 0.2% |
| 4 | 4467 | 3.1% |
| 5 | 1048 | 0.7% |
| 6 | 3108 | 2.2% |
| 7 | 80443 | |
| 8 | 27 | < 0.1% |
| 9 | 185 | 0.1% |
| 10 | 10 | < 0.1% |
| Value | Count | Frequency (%) |
| 25 | 4 | < 0.1% |
| 22 | 21 | < 0.1% |
| 20 | 247 | 0.2% |
| 17 | 9338 | |
| 14 | 2 | < 0.1% |
| 13 | 1 | < 0.1% |
| 11 | 3 | < 0.1% |
| 10 | 10 | < 0.1% |
| 9 | 185 | 0.1% |
| 8 | 27 | < 0.1% |
time_in_hospital
Real number (ℝ)
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.4901899 |
| Minimum | 1 |
|---|---|
| Maximum | 14 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 6 |
| 95-th percentile | 11 |
| Maximum | 14 |
| Range | 13 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.9996667 |
|---|---|
| Coefficient of variation (CV) | 0.66804896 |
| Kurtosis | 0.76214813 |
| Mean | 4.4901899 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.1019709 |
| Sum | 644001 |
| Variance | 8.9980003 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 24986 | |
| 2 | 23475 | |
| 4 | 20064 | |
| 1 | 18530 | |
| 5 | 14452 | |
| 6 | 10880 | |
| 7 | 8569 | 6.0% |
| 8 | 6464 | 4.5% |
| 9 | 4432 | 3.1% |
| 10 | 3416 | 2.4% |
| Other values (4) | 8156 | 5.7% |
| Value | Count | Frequency (%) |
| 1 | 18530 | |
| 2 | 23475 | |
| 3 | 24986 | |
| 4 | 20064 | |
| 5 | 14452 | |
| 6 | 10880 | |
| 7 | 8569 | 6.0% |
| 8 | 6464 | 4.5% |
| 9 | 4432 | 3.1% |
| 10 | 3416 | 2.4% |
| Value | Count | Frequency (%) |
| 14 | 1526 | 1.1% |
| 13 | 1807 | 1.3% |
| 12 | 2116 | 1.5% |
| 11 | 2707 | 1.9% |
| 10 | 3416 | 2.4% |
| 9 | 4432 | 3.1% |
| 8 | 6464 | |
| 7 | 8569 | |
| 6 | 10880 | |
| 5 | 14452 |
payer_code
Categorical
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 54190 |
| Missing (%) | 37.8% |
| Memory size | 1.1 MiB |
| MC | |
|---|---|
| HM | |
| SP | |
| BC | |
| MD | |
| Other values (12) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 178468 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | MC |
|---|---|
| 2nd row | MC |
| 3rd row | MC |
| 4th row | MC |
| 5th row | MC |
Common Values
| Value | Count | Frequency (%) |
| MC | 46532 | |
| HM | 8784 | 6.1% |
| SP | 7613 | 5.3% |
| BC | 6991 | 4.9% |
| MD | 4983 | 3.5% |
| CP | 3687 | 2.6% |
| UN | 3665 | 2.6% |
| CM | 2971 | 2.1% |
| OG | 1532 | 1.1% |
| PO | 919 | 0.6% |
| Other values (7) | 1557 | 1.1% |
| (Missing) | 54190 |
Length
| Value | Count | Frequency (%) |
| mc | 46532 | |
| hm | 8784 | 9.8% |
| sp | 7613 | 8.5% |
| bc | 6991 | 7.8% |
| md | 4983 | 5.6% |
| cp | 3687 | 4.1% |
| un | 3665 | 4.1% |
| cm | 2971 | 3.3% |
| og | 1532 | 1.7% |
| po | 919 | 1.0% |
| Other values (7) | 1557 | 1.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 64149 | |
| C | 60619 | |
| P | 12341 | 6.9% |
| H | 8992 | 5.0% |
| S | 7692 | 4.3% |
| B | 6991 | 3.9% |
| D | 5740 | 3.2% |
| U | 3665 | 2.1% |
| N | 3665 | 2.1% |
| O | 2611 | 1.5% |
| Other values (6) | 2003 | 1.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 178468 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 64149 | |
| C | 60619 | |
| P | 12341 | 6.9% |
| H | 8992 | 5.0% |
| S | 7692 | 4.3% |
| B | 6991 | 3.9% |
| D | 5740 | 3.2% |
| U | 3665 | 2.1% |
| N | 3665 | 2.1% |
| O | 2611 | 1.5% |
| Other values (6) | 2003 | 1.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 178468 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 64149 | |
| C | 60619 | |
| P | 12341 | 6.9% |
| H | 8992 | 5.0% |
| S | 7692 | 4.3% |
| B | 6991 | 3.9% |
| D | 5740 | 3.2% |
| U | 3665 | 2.1% |
| N | 3665 | 2.1% |
| O | 2611 | 1.5% |
| Other values (6) | 2003 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 178468 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 64149 | |
| C | 60619 | |
| P | 12341 | 6.9% |
| H | 8992 | 5.0% |
| S | 7692 | 4.3% |
| B | 6991 | 3.9% |
| D | 5740 | 3.2% |
| U | 3665 | 2.1% |
| N | 3665 | 2.1% |
| O | 2611 | 1.5% |
| Other values (6) | 2003 | 1.1% |
medical_specialty
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 72 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 69463 |
| Missing (%) | 48.4% |
| Memory size | 1.1 MiB |
| InternalMedicine | |
|---|---|
| Emergency/Trauma | |
| Family/GeneralPractice | |
| Cardiology | |
| Surgery-General | |
| Other values (67) |
Length
| Max length | 36 |
|---|---|
| Median length | 33 |
| Mean length | 15.970038 |
| Min length | 6 |
Characters and Unicode
| Total characters | 1181160 |
|---|---|
| Distinct characters | 43 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Pediatrics-Endocrinology |
|---|---|
| 2nd row | InternalMedicine |
| 3rd row | InternalMedicine |
| 4th row | Family/GeneralPractice |
| 5th row | Family/GeneralPractice |
Common Values
| Value | Count | Frequency (%) |
| InternalMedicine | 20403 | 14.2% |
| Emergency/Trauma | 11595 | 8.1% |
| Family/GeneralPractice | 10508 | 7.3% |
| Cardiology | 7473 | 5.2% |
| Surgery-General | 4387 | 3.1% |
| Orthopedics | 2236 | 1.6% |
| Nephrology | 1918 | 1.3% |
| Orthopedics-Reconstructive | 1867 | 1.3% |
| Radiologist | 1611 | 1.1% |
| Pulmonology | 1334 | 0.9% |
| Other values (62) | 10629 | 7.4% |
| (Missing) | 69463 |
Length
| Value | Count | Frequency (%) |
| internalmedicine | 20403 | |
| emergency/trauma | 11595 | |
| family/generalpractice | 10508 | |
| cardiology | 7473 | 10.1% |
| surgery-general | 4387 | 5.9% |
| orthopedics | 2236 | 3.0% |
| nephrology | 1918 | 2.6% |
| orthopedics-reconstructive | 1867 | 2.5% |
| radiologist | 1611 | 2.2% |
| pulmonology | 1334 | 1.8% |
| Other values (62) | 10629 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 149664 | |
| r | 111049 | 9.4% |
| a | 102350 | 8.7% |
| n | 96975 | 8.2% |
| i | 89015 | 7.5% |
| c | 71841 | 6.1% |
| l | 68476 | 5.8% |
| y | 49970 | 4.2% |
| t | 48161 | 4.1% |
| o | 47944 | 4.1% |
| Other values (33) | 345715 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1007746 | |
| Uppercase Letter | 140263 | 11.9% |
| Other Punctuation | 23480 | 2.0% |
| Dash Punctuation | 9671 | 0.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 149664 | |
| r | 111049 | |
| a | 102350 | |
| n | 96975 | |
| i | 89015 | |
| c | 71841 | |
| l | 68476 | 6.8% |
| y | 49970 | 5.0% |
| t | 48161 | 4.8% |
| o | 47944 | 4.8% |
| Other values (13) | 172301 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 20991 | |
| I | 20470 | |
| G | 16620 | |
| P | 14767 | |
| T | 12839 | |
| E | 11944 | |
| F | 10524 | |
| C | 8912 | |
| S | 7574 | 5.4% |
| O | 6071 | 4.3% |
| Other values (7) | 9551 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 23435 | |
| & | 45 | 0.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 9671 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1148009 | |
| Common | 33151 | 2.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 149664 | |
| r | 111049 | 9.7% |
| a | 102350 | 8.9% |
| n | 96975 | 8.4% |
| i | 89015 | 7.8% |
| c | 71841 | 6.3% |
| l | 68476 | 6.0% |
| y | 49970 | 4.4% |
| t | 48161 | 4.2% |
| o | 47944 | 4.2% |
| Other values (30) | 312564 |
Common
| Value | Count | Frequency (%) |
| / | 23435 | |
| - | 9671 | |
| & | 45 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1181160 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 149664 | |
| r | 111049 | 9.4% |
| a | 102350 | 8.7% |
| n | 96975 | 8.2% |
| i | 89015 | 7.5% |
| c | 71841 | 6.1% |
| l | 68476 | 5.8% |
| y | 49970 | 4.2% |
| t | 48161 | 4.1% |
| o | 47944 | 4.1% |
| Other values (33) | 345715 |
primary_diagnosis_code
Categorical
| Distinct | 716 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 33 |
| Missing (%) | < 0.1% |
| Memory size | 1.1 MiB |
| 414 | 9473 |
|---|---|
| 428 | 9385 |
| 786 | 5432 |
| 486 | 5226 |
| 410 | 5076 |
| Other values (711) |
Length
| Max length | 6 |
|---|---|
| Median length | 3 |
| Mean length | 3.1709522 |
| Min length | 1 |
Characters and Unicode
| Total characters | 454686 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 63 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 250.83 |
|---|---|
| 2nd row | 276 |
| 3rd row | 648 |
| 4th row | 8 |
| 5th row | 197 |
Common Values
| Value | Count | Frequency (%) |
| 414 | 9473 | 6.6% |
| 428 | 9385 | 6.5% |
| 786 | 5432 | 3.8% |
| 486 | 5226 | 3.6% |
| 410 | 5076 | 3.5% |
| 427 | 3921 | 2.7% |
| 491 | 3572 | 2.5% |
| 715 | 3514 | 2.5% |
| 682 | 3206 | 2.2% |
| 434 | 3071 | 2.1% |
| Other values (706) | 91515 |
Length
| Value | Count | Frequency (%) |
| 414 | 9473 | 6.6% |
| 428 | 9385 | 6.5% |
| 786 | 5432 | 3.8% |
| 486 | 5226 | 3.6% |
| 410 | 5076 | 3.5% |
| 427 | 3921 | 2.7% |
| 491 | 3572 | 2.5% |
| 715 | 3514 | 2.5% |
| 682 | 3206 | 2.2% |
| 434 | 3071 | 2.1% |
| Other values (706) | 91515 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 79420 | |
| 2 | 56849 | |
| 8 | 53197 | |
| 5 | 50818 | |
| 1 | 39934 | |
| 7 | 39829 | |
| 0 | 34947 | |
| 6 | 32215 | |
| 9 | 28400 | 6.2% |
| 3 | 24992 | 5.5% |
| Other values (3) | 14085 | 3.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 440601 | |
| Other Punctuation | 11699 | 2.6% |
| Uppercase Letter | 2386 | 0.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 79420 | |
| 2 | 56849 | |
| 8 | 53197 | |
| 5 | 50818 | |
| 1 | 39934 | |
| 7 | 39829 | |
| 0 | 34947 | |
| 6 | 32215 | |
| 9 | 28400 | 6.4% |
| 3 | 24992 | 5.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 2384 | |
| E | 2 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 11699 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 452300 | |
| Latin | 2386 | 0.5% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 79420 | |
| 2 | 56849 | |
| 8 | 53197 | |
| 5 | 50818 | |
| 1 | 39934 | |
| 7 | 39829 | |
| 0 | 34947 | |
| 6 | 32215 | |
| 9 | 28400 | 6.3% |
| 3 | 24992 | 5.5% |
Latin
| Value | Count | Frequency (%) |
| V | 2384 | |
| E | 2 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 454686 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 79420 | |
| 2 | 56849 | |
| 8 | 53197 | |
| 5 | 50818 | |
| 1 | 39934 | |
| 7 | 39829 | |
| 0 | 34947 | |
| 6 | 32215 | |
| 9 | 28400 | 6.2% |
| 3 | 24992 | 5.5% |
| Other values (3) | 14085 | 3.1% |
other_diagnosis_codes
Categorical
| Distinct | 19374 |
|---|---|
| Distinct (%) | 13.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| 250|401 | 3637 |
|---|---|
| 401|250 | 3060 |
| 276|276 | 968 |
| 414|250 | 922 |
| 428|427 | 911 |
| Other values (19369) |
Length
| Max length | 13 |
|---|---|
| Median length | 7 |
| Mean length | 7.2870091 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1045132 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 7908 ? |
|---|---|
| Unique (%) | 5.5% |
Sample
| 1st row | ?|? |
|---|---|
| 2nd row | 250.01|255 |
| 3rd row | 250|V27 |
| 4th row | 250.43|403 |
| 5th row | 157|250 |
Common Values
| Value | Count | Frequency (%) |
| 250|401 | 3637 | 2.5% |
| 401|250 | 3060 | 2.1% |
| 276|276 | 968 | 0.7% |
| 414|250 | 922 | 0.6% |
| 428|427 | 911 | 0.6% |
| 250|272 | 867 | 0.6% |
| 403|585 | 805 | 0.6% |
| 276|250 | 740 | 0.5% |
| 414|401 | 719 | 0.5% |
| 250|? | 689 | 0.5% |
| Other values (19364) | 130106 |
Length
| Value | Count | Frequency (%) |
| 250|401 | 3637 | 2.5% |
| 401|250 | 3060 | 2.1% |
| 276|276 | 968 | 0.7% |
| 414|250 | 922 | 0.6% |
| 428|427 | 911 | 0.6% |
| 250|272 | 867 | 0.6% |
| 403|585 | 805 | 0.6% |
| 276|250 | 740 | 0.5% |
| 250 | 719 | 0.5% |
| 414|401 | 719 | 0.5% |
| Other values (19337) | 130076 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 144879 | |
| | | 143424 | |
| 4 | 140989 | |
| 5 | 110514 | |
| 0 | 105491 | |
| 7 | 77580 | |
| 8 | 73866 | |
| 1 | 72404 | |
| 9 | 55576 | 5.3% |
| 6 | 50950 | 4.9% |
| Other values (5) | 69459 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 871198 | |
| Math Symbol | 143424 | 13.7% |
| Other Punctuation | 19917 | 1.9% |
| Uppercase Letter | 10593 | 1.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 144879 | |
| 4 | 140989 | |
| 5 | 110514 | |
| 0 | 105491 | |
| 7 | 77580 | |
| 8 | 73866 | |
| 1 | 72404 | |
| 9 | 55576 | 6.4% |
| 6 | 50950 | 5.8% |
| 3 | 38949 | 4.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 17593 | |
| ? | 2324 | 11.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 7706 | |
| E | 2887 | 27.3% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 143424 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1034539 | |
| Latin | 10593 | 1.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 144879 | |
| | | 143424 | |
| 4 | 140989 | |
| 5 | 110514 | |
| 0 | 105491 | |
| 7 | 77580 | |
| 8 | 73866 | |
| 1 | 72404 | |
| 9 | 55576 | 5.4% |
| 6 | 50950 | 4.9% |
| Other values (3) | 58866 |
Latin
| Value | Count | Frequency (%) |
| V | 7706 | |
| E | 2887 | 27.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1045132 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 144879 | |
| | | 143424 | |
| 4 | 140989 | |
| 5 | 110514 | |
| 0 | 105491 | |
| 7 | 77580 | |
| 8 | 73866 | |
| 1 | 72404 | |
| 9 | 55576 | 5.3% |
| 6 | 50950 | 4.9% |
| Other values (5) | 69459 |
number_outpatient
Real number (ℝ)
| Distinct | 39 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.36242888 |
| Minimum | 0 |
|---|---|
| Maximum | 42 |
| Zeros | 120027 |
| Zeros (%) | 83.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 42 |
| Range | 42 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.2492946 |
|---|---|
| Coefficient of variation (CV) | 3.4470062 |
| Kurtosis | 162.7107 |
| Mean | 0.36242888 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 9.1746431 |
| Sum | 51981 |
| Variance | 1.560737 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 120027 | |
| 1 | 11976 | 8.4% |
| 2 | 5128 | 3.6% |
| 3 | 2808 | 2.0% |
| 4 | 1501 | 1.0% |
| 5 | 749 | 0.5% |
| 6 | 415 | 0.3% |
| 7 | 203 | 0.1% |
| 8 | 133 | 0.1% |
| 9 | 108 | 0.1% |
| Other values (29) | 376 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 120027 | |
| 1 | 11976 | 8.4% |
| 2 | 5128 | 3.6% |
| 3 | 2808 | 2.0% |
| 4 | 1501 | 1.0% |
| 5 | 749 | 0.5% |
| 6 | 415 | 0.3% |
| 7 | 203 | 0.1% |
| 8 | 133 | 0.1% |
| 9 | 108 | 0.1% |
| Value | Count | Frequency (%) |
| 42 | 1 | < 0.1% |
| 40 | 2 | < 0.1% |
| 39 | 1 | < 0.1% |
| 38 | 1 | < 0.1% |
| 37 | 2 | < 0.1% |
| 36 | 7 | |
| 35 | 3 | |
| 34 | 1 | < 0.1% |
| 33 | 2 | < 0.1% |
| 29 | 2 | < 0.1% |
number_inpatient
Real number (ℝ)
| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.60085481 |
| Minimum | 0 |
|---|---|
| Maximum | 21 |
| Zeros | 96698 |
| Zeros (%) | 67.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 21 |
| Range | 21 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.207934 |
|---|---|
| Coefficient of variation (CV) | 2.0103591 |
| Kurtosis | 21.136635 |
| Mean | 0.60085481 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.6429012 |
| Sum | 86177 |
| Variance | 1.4591044 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 96698 | |
| 1 | 27427 | 19.1% |
| 2 | 10194 | 7.1% |
| 3 | 4472 | 3.1% |
| 4 | 2120 | 1.5% |
| 5 | 1031 | 0.7% |
| 6 | 597 | 0.4% |
| 7 | 334 | 0.2% |
| 8 | 179 | 0.1% |
| 9 | 141 | 0.1% |
| Other values (11) | 231 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 96698 | |
| 1 | 27427 | 19.1% |
| 2 | 10194 | 7.1% |
| 3 | 4472 | 3.1% |
| 4 | 2120 | 1.5% |
| 5 | 1031 | 0.7% |
| 6 | 597 | 0.4% |
| 7 | 334 | 0.2% |
| 8 | 179 | 0.1% |
| 9 | 141 | 0.1% |
| Value | Count | Frequency (%) |
| 21 | 1 | < 0.1% |
| 19 | 2 | < 0.1% |
| 18 | 1 | < 0.1% |
| 17 | 1 | < 0.1% |
| 16 | 6 | < 0.1% |
| 15 | 9 | < 0.1% |
| 14 | 14 | < 0.1% |
| 13 | 22 | < 0.1% |
| 12 | 38 | |
| 11 | 57 |
number_emergency
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 33 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1950859 |
| Minimum | 0 |
|---|---|
| Maximum | 76 |
| Zeros | 127444 |
| Zeros (%) | 88.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 76 |
| Range | 76 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.92041018 |
|---|---|
| Coefficient of variation (CV) | 4.7179739 |
| Kurtosis | 1038.0835 |
| Mean | 0.1950859 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 21.5152 |
| Sum | 27980 |
| Variance | 0.8471549 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 127444 | |
| 1 | 10897 | 7.6% |
| 2 | 2867 | 2.0% |
| 3 | 964 | 0.7% |
| 4 | 491 | 0.3% |
| 5 | 252 | 0.2% |
| 6 | 129 | 0.1% |
| 7 | 88 | 0.1% |
| 8 | 59 | < 0.1% |
| 9 | 48 | < 0.1% |
| Other values (23) | 185 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 127444 | |
| 1 | 10897 | 7.6% |
| 2 | 2867 | 2.0% |
| 3 | 964 | 0.7% |
| 4 | 491 | 0.3% |
| 5 | 252 | 0.2% |
| 6 | 129 | 0.1% |
| 7 | 88 | 0.1% |
| 8 | 59 | < 0.1% |
| 9 | 48 | < 0.1% |
| Value | Count | Frequency (%) |
| 76 | 1 | < 0.1% |
| 64 | 1 | < 0.1% |
| 63 | 1 | < 0.1% |
| 54 | 1 | < 0.1% |
| 46 | 3 | |
| 42 | 1 | < 0.1% |
| 37 | 3 | |
| 29 | 1 | < 0.1% |
| 28 | 1 | < 0.1% |
| 25 | 3 |
num_lab_procedures
Real number (ℝ)
| Distinct | 118 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 43.255745 |
| Minimum | 1 |
|---|---|
| Maximum | 132 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 32 |
| median | 44 |
| Q3 | 57 |
| 95-th percentile | 73 |
| Maximum | 132 |
| Range | 131 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 19.657319 |
|---|---|
| Coefficient of variation (CV) | 0.45444412 |
| Kurtosis | -0.23026881 |
| Mean | 43.255745 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | -0.25735981 |
| Sum | 6203912 |
| Variance | 386.41019 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 4656 | 3.2% |
| 43 | 4011 | 2.8% |
| 44 | 3608 | 2.5% |
| 45 | 3387 | 2.4% |
| 38 | 3175 | 2.2% |
| 46 | 3107 | 2.2% |
| 40 | 3086 | 2.2% |
| 41 | 2990 | 2.1% |
| 47 | 2981 | 2.1% |
| 42 | 2928 | 2.0% |
| Other values (108) | 109495 |
| Value | Count | Frequency (%) |
| 1 | 4656 | |
| 2 | 1545 | 1.1% |
| 3 | 961 | 0.7% |
| 4 | 532 | 0.4% |
| 5 | 427 | 0.3% |
| 6 | 392 | 0.3% |
| 7 | 420 | 0.3% |
| 8 | 495 | 0.3% |
| 9 | 1265 | 0.9% |
| 10 | 1143 | 0.8% |
| Value | Count | Frequency (%) |
| 132 | 1 | < 0.1% |
| 129 | 1 | < 0.1% |
| 126 | 1 | < 0.1% |
| 121 | 1 | < 0.1% |
| 120 | 2 | < 0.1% |
| 118 | 1 | < 0.1% |
| 114 | 2 | < 0.1% |
| 113 | 6 | |
| 111 | 3 | |
| 109 | 6 |
number_diagnoses
Real number (ℝ)
| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.4244338 |
| Minimum | 1 |
|---|---|
| Maximum | 16 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 6 |
| median | 8 |
| Q3 | 9 |
| 95-th percentile | 9 |
| Maximum | 16 |
| Range | 15 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.9248717 |
|---|---|
| Coefficient of variation (CV) | 0.25926175 |
| Kurtosis | -0.106884 |
| Mean | 7.4244338 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.86753017 |
| Sum | 1064842 |
| Variance | 3.7051311 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 69427 | |
| 5 | 16215 | 11.3% |
| 8 | 15296 | 10.7% |
| 7 | 14724 | 10.3% |
| 6 | 14170 | 9.9% |
| 4 | 7891 | 5.5% |
| 3 | 3916 | 2.7% |
| 2 | 1369 | 1.0% |
| 1 | 259 | 0.2% |
| 16 | 63 | < 0.1% |
| Other values (6) | 94 | 0.1% |
| Value | Count | Frequency (%) |
| 1 | 259 | 0.2% |
| 2 | 1369 | 1.0% |
| 3 | 3916 | 2.7% |
| 4 | 7891 | 5.5% |
| 5 | 16215 | 11.3% |
| 6 | 14170 | 9.9% |
| 7 | 14724 | 10.3% |
| 8 | 15296 | 10.7% |
| 9 | 69427 | |
| 10 | 22 | < 0.1% |
| Value | Count | Frequency (%) |
| 16 | 63 | < 0.1% |
| 15 | 15 | < 0.1% |
| 14 | 9 | < 0.1% |
| 13 | 21 | < 0.1% |
| 12 | 10 | < 0.1% |
| 11 | 17 | < 0.1% |
| 10 | 22 | < 0.1% |
| 9 | 69427 | |
| 8 | 15296 | 10.7% |
| 7 | 14724 | 10.3% |
num_medications
Real number (ℝ)
| Distinct | 75 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.776035 |
| Minimum | 1 |
|---|---|
| Maximum | 81 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 11 |
| median | 15 |
| Q3 | 21 |
| 95-th percentile | 32 |
| Maximum | 81 |
| Range | 80 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 8.3971304 |
|---|---|
| Coefficient of variation (CV) | 0.50054322 |
| Kurtosis | 3.6180322 |
| Mean | 16.776035 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 1.3776779 |
| Sum | 2406086 |
| Variance | 70.511799 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 13 | 8399 | 5.9% |
| 15 | 8315 | 5.8% |
| 12 | 8227 | 5.7% |
| 14 | 8069 | 5.6% |
| 16 | 7888 | 5.5% |
| 11 | 7756 | 5.4% |
| 17 | 7097 | 4.9% |
| 10 | 7028 | 4.9% |
| 18 | 6635 | 4.6% |
| 9 | 6351 | 4.4% |
| Other values (65) | 67659 |
| Value | Count | Frequency (%) |
| 1 | 262 | 0.2% |
| 2 | 478 | 0.3% |
| 3 | 942 | 0.7% |
| 4 | 1563 | 1.1% |
| 5 | 2316 | 1.6% |
| 6 | 3163 | |
| 7 | 4232 | |
| 8 | 5432 | |
| 9 | 6351 | |
| 10 | 7028 |
| Value | Count | Frequency (%) |
| 81 | 2 | < 0.1% |
| 79 | 3 | < 0.1% |
| 75 | 6 | |
| 74 | 3 | < 0.1% |
| 72 | 6 | |
| 70 | 4 | < 0.1% |
| 69 | 9 | |
| 68 | 10 | |
| 67 | 14 | |
| 66 | 8 |
num_procedures
Real number (ℝ)
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.3490211 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 65788 |
| Zeros (%) | 45.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.7191041 |
|---|---|
| Coefficient of variation (CV) | 1.2743345 |
| Kurtosis | 0.822533 |
| Mean | 1.3490211 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.3112832 |
| Sum | 193482 |
| Variance | 2.9553189 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 65788 | |
| 1 | 29039 | |
| 2 | 17788 | 12.4% |
| 3 | 13252 | 9.2% |
| 6 | 7277 | 5.1% |
| 4 | 5951 | 4.1% |
| 5 | 4329 | 3.0% |
| Value | Count | Frequency (%) |
| 0 | 65788 | |
| 1 | 29039 | |
| 2 | 17788 | 12.4% |
| 3 | 13252 | 9.2% |
| 4 | 5951 | 4.1% |
| 5 | 4329 | 3.0% |
| 6 | 7277 | 5.1% |
| Value | Count | Frequency (%) |
| 6 | 7277 | 5.1% |
| 5 | 4329 | 3.0% |
| 4 | 5951 | 4.1% |
| 3 | 13252 | 9.2% |
| 2 | 17788 | 12.4% |
| 1 | 29039 | |
| 0 | 65788 |
ndc_code
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 251 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 23462 |
| Missing (%) | 16.4% |
| Memory size | 1.1 MiB |
| 68071-1701 | |
|---|---|
| 47918-902 | |
| 47918-898 | |
| 0173-0861 | 4060 |
| 50090-0353 | 4040 |
| Other values (246) |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 9.2123589 |
| Min length | 9 |
Characters and Unicode
| Total characters | 1105133 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 68071-1701 |
|---|---|
| 2nd row | 0378-1110 |
| 3rd row | 68071-1701 |
| 4th row | 0049-4110 |
| 5th row | 68071-1701 |
Common Values
| Value | Count | Frequency (%) |
| 68071-1701 | 20770 | |
| 47918-902 | 20379 | 14.2% |
| 47918-898 | 6568 | 4.6% |
| 0173-0861 | 4060 | 2.8% |
| 50090-0353 | 4040 | 2.8% |
| 0049-4110 | 3431 | 2.4% |
| 0009-3449 | 2501 | 1.7% |
| 0173-0863 | 2305 | 1.6% |
| 0378-1110 | 2208 | 1.5% |
| 0049-4120 | 2183 | 1.5% |
| Other values (241) | 51517 | |
| (Missing) | 23462 |
Length
| Value | Count | Frequency (%) |
| 68071-1701 | 20770 | |
| 47918-902 | 20379 | 17.0% |
| 47918-898 | 6568 | 5.5% |
| 0173-0861 | 4060 | 3.4% |
| 50090-0353 | 4040 | 3.4% |
| 0049-4110 | 3431 | 2.9% |
| 0009-3449 | 2501 | 2.1% |
| 0173-0863 | 2305 | 1.9% |
| 0378-1110 | 2208 | 1.8% |
| 0049-4120 | 2183 | 1.8% |
| Other values (241) | 51517 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 198776 | |
| 1 | 167028 | |
| - | 119962 | |
| 7 | 112993 | |
| 8 | 105508 | |
| 9 | 102969 | |
| 4 | 80297 | |
| 2 | 63767 | 5.8% |
| 3 | 63398 | 5.7% |
| 6 | 51220 | 4.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 985171 | |
| Dash Punctuation | 119962 | 10.9% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 198776 | |
| 1 | 167028 | |
| 7 | 112993 | |
| 8 | 105508 | |
| 9 | 102969 | |
| 4 | 80297 | |
| 2 | 63767 | 6.5% |
| 3 | 63398 | 6.4% |
| 6 | 51220 | 5.2% |
| 5 | 39215 | 4.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 119962 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1105133 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 198776 | |
| 1 | 167028 | |
| - | 119962 | |
| 7 | 112993 | |
| 8 | 105508 | |
| 9 | 102969 | |
| 4 | 80297 | |
| 2 | 63767 | 5.8% |
| 3 | 63398 | 5.7% |
| 6 | 51220 | 4.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1105133 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 198776 | |
| 1 | 167028 | |
| - | 119962 | |
| 7 | 112993 | |
| 8 | 105508 | |
| 9 | 102969 | |
| 4 | 80297 | |
| 2 | 63767 | 5.8% |
| 3 | 63398 | 5.7% |
| 6 | 51220 | 4.6% |
max_glu_serum
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 136409 |
| Missing (%) | 95.1% |
| Memory size | 1.1 MiB |
| Norm | |
|---|---|
| >200 | |
| >300 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 28060 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | >300 |
|---|---|
| 2nd row | >300 |
| 3rd row | Norm |
| 4th row | Norm |
| 5th row | Norm |
Common Values
| Value | Count | Frequency (%) |
| Norm | 3220 | 2.2% |
| >200 | 2043 | 1.4% |
| >300 | 1752 | 1.2% |
| (Missing) | 136409 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| norm | 3220 | |
| 200 | 2043 | |
| 300 | 1752 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 7590 | |
| > | 3795 | |
| N | 3220 | |
| o | 3220 | |
| r | 3220 | |
| m | 3220 | |
| 2 | 2043 | 7.3% |
| 3 | 1752 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 11385 | |
| Lowercase Letter | 9660 | |
| Math Symbol | 3795 | 13.5% |
| Uppercase Letter | 3220 | 11.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 7590 | |
| 2 | 2043 | 17.9% |
| 3 | 1752 | 15.4% |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 3220 | |
| r | 3220 | |
| m | 3220 |
Math Symbol
| Value | Count | Frequency (%) |
| > | 3795 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 3220 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 15180 | |
| Latin | 12880 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 7590 | |
| > | 3795 | |
| 2 | 2043 | 13.5% |
| 3 | 1752 | 11.5% |
Latin
| Value | Count | Frequency (%) |
| N | 3220 | |
| o | 3220 | |
| r | 3220 | |
| m | 3220 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 28060 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 7590 | |
| > | 3795 | |
| N | 3220 | |
| o | 3220 | |
| r | 3220 | |
| m | 3220 | |
| 2 | 2043 | 7.3% |
| 3 | 1752 | 6.2% |
A1Cresult
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 117650 |
| Missing (%) | 82.0% |
| Memory size | 1.1 MiB |
| >8 | |
|---|---|
| Norm | |
| >7 |
Length
| Max length | 4 |
|---|---|
| Median length | 2 |
| Mean length | 2.5396912 |
| Min length | 2 |
Characters and Unicode
| Total characters | 65458 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | >7 |
|---|---|
| 2nd row | >7 |
| 3rd row | >7 |
| 4th row | >8 |
| 5th row | Norm |
Common Values
| Value | Count | Frequency (%) |
| >8 | 13110 | 9.1% |
| Norm | 6955 | 4.8% |
| >7 | 5709 | 4.0% |
| (Missing) | 117650 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 8 | 13110 | |
| norm | 6955 | |
| 7 | 5709 |
Most occurring characters
| Value | Count | Frequency (%) |
| > | 18819 | |
| 8 | 13110 | |
| N | 6955 | 10.6% |
| o | 6955 | 10.6% |
| r | 6955 | 10.6% |
| m | 6955 | 10.6% |
| 7 | 5709 | 8.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 20865 | |
| Math Symbol | 18819 | |
| Decimal Number | 18819 | |
| Uppercase Letter | 6955 | 10.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 6955 | |
| r | 6955 | |
| m | 6955 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 13110 | |
| 7 | 5709 |
Math Symbol
| Value | Count | Frequency (%) |
| > | 18819 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 6955 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 37638 | |
| Latin | 27820 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 6955 | |
| o | 6955 | |
| r | 6955 | |
| m | 6955 |
Common
| Value | Count | Frequency (%) |
| > | 18819 | |
| 8 | 13110 | |
| 7 | 5709 | 15.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 65458 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| > | 18819 | |
| 8 | 13110 | |
| N | 6955 | 10.6% |
| o | 6955 | 10.6% |
| r | 6955 | 10.6% |
| m | 6955 | 10.6% |
| 7 | 5709 | 8.7% |
change
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| Ch | |
|---|---|
| No |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 286848 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | No |
|---|---|
| 2nd row | Ch |
| 3rd row | No |
| 4th row | Ch |
| 5th row | Ch |
Common Values
| Value | Count | Frequency (%) |
| Ch | 88669 | |
| No | 54755 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| ch | 88669 | |
| no | 54755 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 88669 | |
| h | 88669 | |
| N | 54755 | |
| o | 54755 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 143424 | |
| Lowercase Letter | 143424 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 88669 | |
| N | 54755 |
Lowercase Letter
| Value | Count | Frequency (%) |
| h | 88669 | |
| o | 54755 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 286848 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 88669 | |
| h | 88669 | |
| N | 54755 | |
| o | 54755 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 286848 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 88669 | |
| h | 88669 | |
| N | 54755 | |
| o | 54755 |
readmitted
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| NO | |
|---|---|
| >30 | |
| <30 |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.4614012 |
| Min length | 2 |
Characters and Unicode
| Total characters | 353024 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NO |
|---|---|
| 2nd row | >30 |
| 3rd row | NO |
| 4th row | NO |
| 5th row | NO |
Common Values
| Value | Count | Frequency (%) |
| NO | 77248 | |
| >30 | 50434 | |
| <30 | 15742 | 11.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| no | 77248 | |
| 30 | 66176 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 77248 | |
| O | 77248 | |
| 3 | 66176 | |
| 0 | 66176 | |
| > | 50434 | |
| < | 15742 | 4.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 154496 | |
| Decimal Number | 132352 | |
| Math Symbol | 66176 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 77248 | |
| O | 77248 |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 66176 | |
| 0 | 66176 |
Math Symbol
| Value | Count | Frequency (%) |
| > | 50434 | |
| < | 15742 | 23.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 198528 | |
| Latin | 154496 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 66176 | |
| 0 | 66176 | |
| > | 50434 | |
| < | 15742 | 7.9% |
Latin
| Value | Count | Frequency (%) |
| N | 77248 | |
| O | 77248 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 353024 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 77248 | |
| O | 77248 | |
| 3 | 66176 | |
| 0 | 66176 | |
| > | 50434 | |
| < | 15742 | 4.5% |
| encounter_id | patient_nbr | admission_type_id | discharge_disposition_id | admission_source_id | time_in_hospital | number_outpatient | number_inpatient | number_emergency | num_lab_procedures | number_diagnoses | num_medications | num_procedures | race | gender | age | weight | payer_code | medical_specialty | max_glu_serum | A1Cresult | change | readmitted | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| encounter_id | 1.000 | 0.530 | -0.117 | -0.054 | -0.050 | -0.055 | 0.137 | 0.035 | 0.123 | -0.005 | 0.290 | 0.103 | -0.034 | 0.074 | 0.014 | 0.036 | 0.071 | 0.070 | 0.177 | 0.179 | 0.128 | 0.115 | 0.074 |
| patient_nbr | 0.530 | 1.000 | 0.011 | -0.036 | 0.029 | -0.013 | 0.144 | 0.028 | 0.108 | 0.031 | 0.239 | 0.042 | -0.021 | 0.110 | 0.035 | 0.039 | 0.014 | 0.131 | 0.183 | 0.159 | 0.124 | 0.117 | 0.119 |
| admission_type_id | -0.117 | 0.011 | 1.000 | 0.033 | -0.393 | -0.007 | 0.034 | -0.038 | -0.028 | -0.219 | -0.117 | 0.100 | 0.220 | 0.069 | 0.022 | 0.039 | 0.107 | 0.103 | 0.287 | 0.123 | 0.072 | 0.065 | 0.041 |
| discharge_disposition_id | -0.054 | -0.036 | 0.033 | 1.000 | 0.034 | 0.275 | 0.038 | 0.080 | 0.009 | 0.054 | 0.151 | 0.172 | 0.026 | 0.025 | 0.036 | 0.057 | 0.035 | 0.049 | 0.133 | 0.069 | 0.041 | 0.091 | 0.115 |
| admission_source_id | -0.050 | 0.029 | -0.393 | 0.034 | 1.000 | -0.001 | 0.021 | 0.049 | 0.099 | 0.136 | 0.106 | -0.074 | -0.205 | 0.050 | 0.019 | 0.036 | 0.135 | 0.053 | 0.274 | 0.153 | 0.043 | 0.025 | 0.055 |
| time_in_hospital | -0.055 | -0.013 | -0.007 | 0.275 | -0.001 | 1.000 | -0.016 | 0.086 | -0.001 | 0.339 | 0.235 | 0.466 | 0.197 | 0.014 | 0.038 | 0.042 | 0.040 | 0.037 | 0.100 | 0.146 | 0.025 | 0.113 | 0.047 |
| number_outpatient | 0.137 | 0.144 | 0.034 | 0.038 | 0.021 | -0.016 | 1.000 | 0.148 | 0.168 | -0.026 | 0.108 | 0.067 | -0.023 | 0.016 | 0.009 | 0.007 | 0.000 | 0.020 | 0.030 | 0.021 | 0.025 | 0.008 | 0.029 |
| number_inpatient | 0.035 | 0.028 | -0.038 | 0.080 | 0.049 | 0.086 | 0.148 | 1.000 | 0.218 | 0.030 | 0.133 | 0.083 | -0.066 | 0.007 | 0.012 | 0.045 | 0.007 | 0.030 | 0.047 | 0.066 | 0.015 | 0.009 | 0.124 |
| number_emergency | 0.123 | 0.108 | -0.028 | 0.009 | 0.099 | -0.001 | 0.168 | 0.218 | 1.000 | 0.001 | 0.089 | 0.037 | -0.047 | 0.000 | 0.014 | 0.029 | 0.000 | 0.030 | 0.039 | 0.010 | 0.011 | 0.011 | 0.031 |
| num_lab_procedures | -0.005 | 0.031 | -0.219 | 0.054 | 0.136 | 0.339 | -0.026 | 0.030 | 0.001 | 1.000 | 0.171 | 0.251 | 0.034 | 0.046 | 0.023 | 0.023 | 0.052 | 0.039 | 0.134 | 0.151 | 0.030 | 0.057 | 0.030 |
| number_diagnoses | 0.290 | 0.239 | -0.117 | 0.151 | 0.106 | 0.235 | 0.108 | 0.133 | 0.089 | 0.171 | 1.000 | 0.289 | 0.071 | 0.056 | 0.009 | 0.120 | 0.091 | 0.075 | 0.154 | 0.055 | 0.109 | 0.040 | 0.084 |
| num_medications | 0.103 | 0.042 | 0.100 | 0.172 | -0.074 | 0.466 | 0.067 | 0.083 | 0.037 | 0.251 | 0.289 | 1.000 | 0.363 | 0.039 | 0.054 | 0.062 | 0.030 | 0.041 | 0.166 | 0.142 | 0.035 | 0.252 | 0.064 |
| num_procedures | -0.034 | -0.021 | 0.220 | 0.026 | -0.205 | 0.197 | -0.023 | -0.066 | -0.047 | 0.034 | 0.071 | 0.363 | 1.000 | 0.027 | 0.063 | 0.061 | 0.053 | 0.042 | 0.225 | 0.050 | 0.027 | 0.027 | 0.038 |
| race | 0.074 | 0.110 | 0.069 | 0.025 | 0.050 | 0.014 | 0.016 | 0.007 | 0.000 | 0.046 | 0.056 | 0.039 | 0.027 | 1.000 | 0.073 | 0.099 | 0.038 | 0.105 | 0.130 | 0.028 | 0.060 | 0.022 | 0.026 |
| gender | 0.014 | 0.035 | 0.022 | 0.036 | 0.019 | 0.038 | 0.009 | 0.012 | 0.014 | 0.023 | 0.009 | 0.054 | 0.063 | 0.073 | 1.000 | 0.106 | 0.217 | 0.108 | 0.154 | 0.000 | 0.038 | 0.021 | 0.020 |
| age | 0.036 | 0.039 | 0.039 | 0.057 | 0.036 | 0.042 | 0.007 | 0.045 | 0.029 | 0.023 | 0.120 | 0.062 | 0.061 | 0.099 | 0.106 | 1.000 | 0.149 | 0.193 | 0.294 | 0.119 | 0.174 | 0.070 | 0.040 |
| weight | 0.071 | 0.014 | 0.107 | 0.035 | 0.135 | 0.040 | 0.000 | 0.007 | 0.000 | 0.052 | 0.091 | 0.030 | 0.053 | 0.038 | 0.217 | 0.149 | 1.000 | 0.091 | 0.088 | 0.000 | 0.087 | 0.092 | 0.057 |
| payer_code | 0.070 | 0.131 | 0.103 | 0.049 | 0.053 | 0.037 | 0.020 | 0.030 | 0.030 | 0.039 | 0.075 | 0.041 | 0.042 | 0.105 | 0.108 | 0.193 | 0.091 | 1.000 | 0.114 | 0.146 | 0.187 | 0.090 | 0.065 |
| medical_specialty | 0.177 | 0.183 | 0.287 | 0.133 | 0.274 | 0.100 | 0.030 | 0.047 | 0.039 | 0.134 | 0.154 | 0.166 | 0.225 | 0.130 | 0.154 | 0.294 | 0.088 | 0.114 | 1.000 | 0.117 | 0.126 | 0.153 | 0.108 |
| max_glu_serum | 0.179 | 0.159 | 0.123 | 0.069 | 0.153 | 0.146 | 0.021 | 0.066 | 0.010 | 0.151 | 0.055 | 0.142 | 0.050 | 0.028 | 0.000 | 0.119 | 0.000 | 0.146 | 0.117 | 1.000 | 0.386 | 0.235 | 0.056 |
| A1Cresult | 0.128 | 0.124 | 0.072 | 0.041 | 0.043 | 0.025 | 0.025 | 0.015 | 0.011 | 0.030 | 0.109 | 0.035 | 0.027 | 0.060 | 0.038 | 0.174 | 0.087 | 0.187 | 0.126 | 0.386 | 1.000 | 0.173 | 0.010 |
| change | 0.115 | 0.117 | 0.065 | 0.091 | 0.025 | 0.113 | 0.008 | 0.009 | 0.011 | 0.057 | 0.040 | 0.252 | 0.027 | 0.022 | 0.021 | 0.070 | 0.092 | 0.090 | 0.153 | 0.235 | 0.173 | 1.000 | 0.034 |
| readmitted | 0.074 | 0.119 | 0.041 | 0.115 | 0.055 | 0.047 | 0.029 | 0.124 | 0.031 | 0.030 | 0.084 | 0.064 | 0.038 | 0.026 | 0.020 | 0.040 | 0.057 | 0.065 | 0.108 | 0.056 | 0.010 | 0.034 | 1.000 |
| encounter_id | patient_nbr | race | gender | age | weight | admission_type_id | discharge_disposition_id | admission_source_id | time_in_hospital | payer_code | medical_specialty | primary_diagnosis_code | other_diagnosis_codes | number_outpatient | number_inpatient | number_emergency | num_lab_procedures | number_diagnoses | num_medications | num_procedures | ndc_code | max_glu_serum | A1Cresult | change | readmitted | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2278392 | 8222157 | Caucasian | Female | [0-10) | NaN | 6 | 25 | 1 | 1 | NaN | Pediatrics-Endocrinology | 250.83 | ?|? | 0 | 0 | 0 | 41 | 1 | 1 | 0 | NaN | NaN | NaN | No | NO |
| 1 | 149190 | 55629189 | Caucasian | Female | [10-20) | NaN | 1 | 1 | 7 | 3 | NaN | NaN | 276 | 250.01|255 | 0 | 0 | 0 | 59 | 9 | 18 | 0 | 68071-1701 | NaN | NaN | Ch | >30 |
| 2 | 64410 | 86047875 | AfricanAmerican | Female | [20-30) | NaN | 1 | 1 | 7 | 2 | NaN | NaN | 648 | 250|V27 | 2 | 1 | 0 | 11 | 6 | 13 | 5 | 0378-1110 | NaN | NaN | No | NO |
| 3 | 500364 | 82442376 | Caucasian | Male | [30-40) | NaN | 1 | 1 | 7 | 2 | NaN | NaN | 8 | 250.43|403 | 0 | 0 | 0 | 44 | 7 | 16 | 1 | 68071-1701 | NaN | NaN | Ch | NO |
| 4 | 16680 | 42519267 | Caucasian | Male | [40-50) | NaN | 1 | 1 | 7 | 1 | NaN | NaN | 197 | 157|250 | 0 | 0 | 0 | 51 | 5 | 8 | 0 | 0049-4110 | NaN | NaN | Ch | NO |
| 5 | 16680 | 42519267 | Caucasian | Male | [40-50) | NaN | 1 | 1 | 7 | 1 | NaN | NaN | 197 | 157|250 | 0 | 0 | 0 | 51 | 5 | 8 | 0 | 68071-1701 | NaN | NaN | Ch | NO |
| 6 | 35754 | 82637451 | Caucasian | Male | [50-60) | NaN | 2 | 1 | 2 | 3 | NaN | NaN | 414 | 411|250 | 0 | 0 | 0 | 31 | 9 | 16 | 6 | 47918-902 | NaN | NaN | No | >30 |
| 7 | 55842 | 84259809 | Caucasian | Male | [60-70) | NaN | 3 | 1 | 2 | 4 | NaN | NaN | 414 | 411|V45 | 0 | 0 | 0 | 70 | 7 | 21 | 1 | 35208-001 | NaN | NaN | Ch | NO |
| 8 | 55842 | 84259809 | Caucasian | Male | [60-70) | NaN | 3 | 1 | 2 | 4 | NaN | NaN | 414 | 411|V45 | 0 | 0 | 0 | 70 | 7 | 21 | 1 | 16729-001 | NaN | NaN | Ch | NO |
| 9 | 55842 | 84259809 | Caucasian | Male | [60-70) | NaN | 3 | 1 | 2 | 4 | NaN | NaN | 414 | 411|V45 | 0 | 0 | 0 | 70 | 7 | 21 | 1 | 47918-891 | NaN | NaN | Ch | NO |
| encounter_id | patient_nbr | race | gender | age | weight | admission_type_id | discharge_disposition_id | admission_source_id | time_in_hospital | payer_code | medical_specialty | primary_diagnosis_code | other_diagnosis_codes | number_outpatient | number_inpatient | number_emergency | num_lab_procedures | number_diagnoses | num_medications | num_procedures | ndc_code | max_glu_serum | A1Cresult | change | readmitted | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 143414 | 443847176 | 50375628 | AfricanAmerican | Female | [60-70) | NaN | 1 | 1 | 7 | 6 | DM | NaN | 345 | 438|412 | 3 | 2 | 1 | 45 | 9 | 25 | 1 | 50090-0353 | NaN | NaN | Ch | >30 |
| 143415 | 443847548 | 100162476 | AfricanAmerican | Male | [70-80) | NaN | 1 | 3 | 7 | 3 | MC | NaN | 250.13 | 291|458 | 0 | 0 | 0 | 51 | 9 | 16 | 0 | 42708-009 | NaN | >8 | Ch | >30 |
| 143416 | 443847548 | 100162476 | AfricanAmerican | Male | [70-80) | NaN | 1 | 3 | 7 | 3 | MC | NaN | 250.13 | 291|458 | 0 | 0 | 0 | 51 | 9 | 16 | 0 | 68071-1701 | NaN | >8 | Ch | >30 |
| 143417 | 443847782 | 74694222 | AfricanAmerican | Female | [80-90) | NaN | 1 | 4 | 5 | 5 | MC | NaN | 560 | 276|787 | 0 | 1 | 0 | 33 | 9 | 18 | 3 | 68071-1701 | NaN | NaN | No | NO |
| 143418 | 443854148 | 41088789 | Caucasian | Male | [70-80) | NaN | 1 | 1 | 7 | 1 | MC | NaN | 38 | 590|296 | 1 | 0 | 0 | 53 | 13 | 9 | 0 | 10631-019 | NaN | NaN | Ch | NO |
| 143419 | 443854148 | 41088789 | Caucasian | Male | [70-80) | NaN | 1 | 1 | 7 | 1 | MC | NaN | 38 | 590|296 | 1 | 0 | 0 | 53 | 13 | 9 | 0 | 47918-902 | NaN | NaN | Ch | NO |
| 143420 | 443857166 | 31693671 | Caucasian | Female | [80-90) | NaN | 2 | 3 | 7 | 10 | MC | Surgery-General | 996 | 285|998 | 0 | 1 | 0 | 45 | 9 | 21 | 2 | 0049-4110 | NaN | NaN | Ch | NO |
| 143421 | 443857166 | 31693671 | Caucasian | Female | [80-90) | NaN | 2 | 3 | 7 | 10 | MC | Surgery-General | 996 | 285|998 | 0 | 1 | 0 | 45 | 9 | 21 | 2 | 0781-5421 | NaN | NaN | Ch | NO |
| 143422 | 443857166 | 31693671 | Caucasian | Female | [80-90) | NaN | 2 | 3 | 7 | 10 | MC | Surgery-General | 996 | 285|998 | 0 | 1 | 0 | 45 | 9 | 21 | 2 | 47918-902 | NaN | NaN | Ch | NO |
| 143423 | 443867222 | 175429310 | Caucasian | Male | [70-80) | NaN | 1 | 1 | 7 | 6 | NaN | NaN | 530 | 530|787 | 0 | 0 | 0 | 13 | 9 | 3 | 3 | NaN | NaN | NaN | No | NO |